[WIP] [Transform] Compress, decompress #333

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft

kylesayrs wants to merge 45 commits into main from kylesayrs/transform_status

Contributor

kylesayrs commented May 31, 2025 •

edited

Loading

idea: submodule structure handles most serialization for us
let's not couple apply_transform_config with apply_quantization_config, otherwise we'd have potential conflicts with the QuantizationMixin

somehow, we need to allow the model_compressor to know the q_config and t_config. In the case of q_config, it's actually built on the fly. That kinda works for q_config, since all the schemes are present (although you lose config_group names). That wouldn't directly work for t_config, since the schemes are still transparent.

A simple solution would be to move towards a pattern where q_config (and as a subfield, t_config) are attached as an attribute to the mode directly, then grabbed by model_compressor. This seems to make sense, I don't see many downsides

Need to decide if we want to keep the weight submodules in the compressed state. The issue is that, without saving them, then there's no way to go from compressed to decompressed. However, saving them requires extra storage and vllm has to ignore those weights

Let's not keep weight transforms, except when trainable. During decompression, let's add activation hooks (these will need to be added by quantization anyways)

---

When saving, seems like we need to extend the _tied_weights_keys attribute in order to avoid saving issues
_dynamic_tied_weights_keys

kylesayrs added 8 commits

May 30, 2025 13:40


          add utilities

d8a10ec

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          add tests

d2af054

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          add additional tests

e32d5b5

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          add utils and tests

9d0518b

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Implement transform factories

8c5a2d9

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform_utils' into kylesayrs/transform_fac…

809e367

…tory


          add permutations

8d613b3

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          wip: compression

70c2dfe

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs changed the base branch from main to kylesayrs/transform_permutations

May 31, 2025 04:46

kylesayrs added 2 commits

May 31, 2025 00:48


          add delete_offload_module

57d171a

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform-accelerate-utilities' into kylesayr…

d77bcef

…s/transform_factory

kylesayrs changed the title ~~[Transform] Apply, serialize, deserialize~~ [WIP] [Transform] Apply, serialize, deserialize

kylesayrs added 3 commits

May 31, 2025 00:52


          Merge branch 'kylesayrs/transform-accelerate-utilities' into kylesayr…

ab73b43

…s/transform_permutations


          Merge branch 'kylesayrs/transform_permutations' into kylesayrs/transf…

9f7d298

…orm_status


          Merge branch 'kylesayrs/transform_factory' into kylesayrs/transform_p…

4b55733

…ermutations

kylesayrs changed the title ~~[WIP] [Transform] Apply, serialize, deserialize~~ [WIP] [Transform] Apply, compress, decompress

kylesayrs changed the title ~~[WIP] [Transform] Apply, compress, decompress~~ [WIP] [Transform] Compress, decompress

kylesayrs added 13 commits

May 31, 2025 09:53


          key inverses by weight

aa7d21b

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          fix tests

6901e02

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          standardize random hadamard

47ae9fe

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform_utils' into kylesayrs/transform_fac…

34f1343

…tory


          prepend input hooks

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge remote-tracking branch 'origin' into kylesayrs/transform_utils


          apply sqrt division first

68ec14e

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform_utils' into kylesayrs/transform_fac…

a62418a

…tory


          use divided hadamards

b117523

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          fix typo

a46f754

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          add random option

cb1cb52

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform_utils' into kylesayrs/transform_fac…

7c02bb2

…tory


          use random seeds, rename matrix multiply

02af1e9

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs added 19 commits

June 5, 2025 14:24


          add deterministic generation to random matrix

f45f3e9

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          fix perm math

7a7abdf

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          update docstrings

6e52894

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          update docstrings

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform_factory' into kylesayrs/transform_p…

f74fe3e

…ermutations


          cleanup

92ddea9

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          cleanup 2

779956f

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform_utils' into kylesayrs/transform_fac…

fbd2939

…tory


          make seed optional

dd72b6a

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform_factory' into kylesayrs/transform_p…

4ae491d

…ermutations


          remove iterable check and missing return value

da19b0f

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'main' into kylesayrs/transform_permutations

7ab17ce


          Merge remote-tracking branch 'origin' into kylesayrs/transform_permut…

33df50f

…ations


          Remove unrelated changes

6e1ec39


          simplify code

938e702

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          Merge branch 'kylesayrs/transform_permutations' into kylesayrs/transf…

e4e3cdc

…orm_status


          test compression decompression

78dce63

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          wip update tests

b0b82a1

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>


          fix weight transform offloading

62fd754

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Base automatically changed from kylesayrs/transform_permutations to main

July 7, 2025 15:35

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet